Tarcil: Reconciling Scheduling Speed and Quality in Large Datacenters
نویسندگان
چکیده
Scheduling diverse applications in large, shared clusters is particularly challenging. Recent research on cluster scheduling focuses either on scheduling speed, using sampling to quickly assign resources to tasks, or on scheduling quality, using centralized algorithms that search for the resources that improve both task performance and cluster utilization. We present Tarcil, a distributed scheduler that targets both scheduling speed and quality. Tarcil uses an analytically derived sampling framework that adjusts the sample size based on load, and provides statistical guarantees on the quality of allocated resources. It also implements admission control when sampling is unlikely to find suitable resources. This makes it appropriate for large, shared clusters hosting shortand long-running jobs. We evaluate Tarcil on clusters with hundreds of servers on EC2. For highly-loaded clusters running short jobs, Tarcil improves task execution time by 41% over a distributed, sampling-based scheduler. For more general scenarios, Tarcil achieves near-optimal performance for 4x and 2x more jobs than sampling-based and centralized schedulers respectively.
منابع مشابه
Tarcil: High Quality and Low Latency Scheduling in Large, Shared Clusters
Scheduling diverse applications in large, shared clusters is particularly challenging. Recent research on cluster management focuses either on scheduling speed, using sampling techniques to quickly assign tasks to resources, or on scheduling quality, using centralized algorithms that examine the cluster state to find the most suitable resources that improve both task performance and cluster uti...
متن کاملReconciling High-Speed Scheduling with Dispatching in Wafer Fabs
A novel approach to reconciling global optimizing scheduling with real-time dispatching is presented. High speed scheduling is accomplished by two simultaneous cooperating scheduling processes, one of which generates longer term schedules while the other is focused with higher resolution on the nearer term. Real-time dispatching is based on the most recently generated near term schedule, carefu...
متن کاملManaging and Scheduling Approximate Applications to Utilize Renewable Energy in Cloud Computing Datacenters
With the rapid development of cloud computing, the power consumption of datacenters is getting much higher in recent years. Renewable energy becomes promising as the power supply for datacenters since it helps to greatly save the consumption of traditional power resources. In order to improve the utilization of renewable energy in data centers, we proposed several adaptive scheduling algorithms...
متن کاملQoS-Aware Admission Control in Heterogeneous Datacenters
Large-scale datacenters (DCs) host tens of thousands of diverse applications each day. Apart from determining where to schedule workloads, the cluster manager should also decide when to constrain application admission to prevent system oversubscription. At the same time datacenter users care not only for fast execution time but for low waiting time (fast scheduling) as well. Recent work has add...
متن کاملData-Aware Scheduling in Datacenters
Title of dissertation: DATA-AWARE SCHEDULING IN DATACENTERS Manish Purohit, Doctor of Philosophy, 2016 Dissertation directed by: Professor Samir Khuller Department of Computer Science Datacenters have emerged as the dominant form of computing infrastructure over the last two decades. The tremendous increase in the requirements of data analysis has led to a proportional increase in power consump...
متن کامل